Picture for Fanyi Xiao

Fanyi Xiao

Sparse CLIP: Co-Optimizing Interpretability and Performance in Contrastive Learning

Add code
Jan 27, 2026
Viaarxiv icon

Benchmarking Egocentric Multimodal Goal Inference for Assistive Wearable Agents

Add code
Oct 25, 2025
Viaarxiv icon

EdgeTAM: On-Device Track Anything Model

Add code
Jan 13, 2025
Viaarxiv icon

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Figure 1 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 2 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 3 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 4 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Viaarxiv icon

Gen2Det: Generate to Detect

Add code
Dec 07, 2023
Viaarxiv icon

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

Add code
Dec 04, 2023
Figure 1 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 2 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 3 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 4 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Viaarxiv icon

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Add code
Dec 01, 2023
Figure 1 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Figure 2 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Figure 3 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Figure 4 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Viaarxiv icon

EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding

Add code
Sep 15, 2023
Figure 1 for EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
Figure 2 for EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
Figure 3 for EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
Figure 4 for EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
Viaarxiv icon

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

Add code
Jun 01, 2023
Viaarxiv icon

Going Denser with Open-Vocabulary Part Segmentation

Add code
May 18, 2023
Viaarxiv icon